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ABSTRACT - . 

At a workshop exploring practical methods that 
elementary ai^d secondary teachers may use to analyze texts, three 
parallel texts vtere subjected to three forms of analysis: subjective 
analysis, examination of cohesive ties, and linguistic analysis of 
selected text variables such as word and sentence length. The 
subjective ranking of the passages' difficulty illustrated the varied 
factors on which individual judgments are based. Calculation of the 
percentage of grammatical and lexical ties per word and per sentence 
failed to correspond to workshop participants' subjective judgments 
of difficulty, suggesting that. the connection between cohesion and 
comprehensibility neecls to be investigated further. Rix 
scores—obtained for each passage by counting the number of long 
words, dividing by the number of sentences, and multiplying by - 
100 — corresponded roughly to the subjective ratings. While providing 
an objective measure, the. Rix scale's equation of sentence and word 
length with sentence and word complexity is questionable. Although 
these measures are indirect, they may nevertheless provide teachers 
with help in critically evaluating classroom texts. (MM) 
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The purpose. of this workshop session was to examine practical procedures 
that teachers, both primary and secondary, may use to Analyse texts. Vl^a 
commonly; employed procedures for assessing either reading comprehension or 
the comprehensibillty of tej^ts focus on word factorrs (tapping lexical , 
difficulty) arid sentence factors (tappiiig syntactic complexity). However, 

as teachers know well, while many a Reader, native speaker and.second- 

.... . ■ 

language leaxher, may be familiar with the lexical items and syntactic' 
structures within a text, difficulties are often still experienced with 
coraprehen^iing the/^text as a whole. More recently, researchers have provided 
^ framework for Ipokiug a1: factors beyond the sentence, termed cohesive ties, 
that link t^xt tdgether providing continuity and texture. ^ 

, The workshop providted opportunities to analyse texts in terms of the \ 
l^cohesive properties and other linguistic f actorsN^hich may serve to. anticipare 
the ea^e or difficulty of reading them. ^ 

\ '•'^ 
\ . ^ 

Subjective analysis • , ^ \ ' 

Three parallel texti (see next page) were distributed and participants 
were invited to rank these in order of difficulty as vthey might appear to ^ 
young readers. There was ^ome discussion on the group rank orderings in an ^ 
effort to discover what aspects of texts^ teachers focused in making their 
judgments. Although th^re was ^ar from perfect agreement, the majority view 
placed Pass^e C as most difficult and Passage B afe easiest. The discussion 
illustrated the subjective nature of global judgment as well as the fact 
that individual judgments were often based on a variety of different factors. 
<rKe major purpose of th6 exercise was not to -disparage subjective an*alyses 
but to begin to focus attention on the major characteristics of text. 



Three parallel texts to be placed on next page 
following first line of abova paragraph 



It was readily apparent that the register was constant acr^s texLa. 
In other words, not only was the story the same but the storyteller's 
purpose was similar in each case, as was the style. In.H^lliday's Lei.mo, 

/ 

\ 

J 



the^ field, mode ^d tenor were similar for 4II three texts. ^This meant, then,, 
that if the texts did indeed differ in difficulty thexi such difficulty could 
be attributed ^ssentially to text variables rather than factors of style or genre. 

■ ., _ /•• .■ 

Cohesive ties ' ' * • 

• * . ■ ■ / 

Attention was directed first to cohesive ties, the semantic links which 
according to Halliday and Hasan (1976) "make a text a text" (p. 1,3). Another 
writer has said of Coli^^ve ties: 

... the cohesive ti'es which bind or bridge sentences semantically 
eftiable the reader to establish , coherence in text and therefore 
account for a large portion of ... comprehehsibility in, written 
discourse (Moe 1979, p. 16). 

To illustrate how linguistic ties quite literally hold text together, 
like "a kind of linguistic mortar" as Tiemey and Mosenthal (1980) expressed 
it, a tracing of the ties in the nursery rhyme of Old Mother Hubbard was made: 



Old Mother Hubbard rhyme about here 

r 1 



Halliday and Hasan (1976) in their pioneering book Cohesion, in English 
identified five major categories of cohe^on. These categories - reference, 
substitution, ellipsis, conjunction and lexical cohesion - were in turn each 
sub-divided into two or more sub-categories as illustrated in the Figtire 



below. ' . ^ ' 
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Halliday and Hasan's categories of cohesion about here 



tor those untarailiar with the concept of cohesion the following description 
from Anderson (1982) illustrates the major type^ and sub-categories: 

1 , Reference \ ^ 

Reference, acs ilie name suggeaLb, tequltea reference claewlic:ie In Lht 
text for interpretation of one iLem in the tie. Halliday and Hasan 
identify three sub-categories of reference, 
a) Personal reference 

e.g. 1. Joan wciiL Lo town. 

2. She Loughi herself a haL. 
\ ' (She and herself refer to Jo-^^) . 



This sub-categorj^ Includes the personal^ pronouns, possessive 
adjectives and possessive pronouns (I, me, mine, mjr, he^ him, his , 
it , its , one , one' s etc.) 

b) Demonstrative reference ' 
e.g. 3. jQan went to town. 

4. There she bought a hat.- v 

5, The hat was a delicate shade of pink. 

(There refers to ^town and the to the same hat as in 4). 
This sub-category includes the demonstratives (this, that, these, 
those , here , there , now, then , the ) . 

c) Comparative reference - > 
e.g. 6. Joan bought a hat. 

7. It is the same as mine. 
(same refers to hat ) . 
This sub-category includes comparative adjectives and adverbs 
" (some, identical , similarly , other, better, more, so etc.) - 

Substitution • 

In substitution one item (usually one, do or s&) substitutes for another. 
Halliday and-Hasan describe three sub-categories .of substitution 
depending on whether the substitute functions as % noun, verb or clause. 

a) Nominal substitution 

e.g. 8. Robert got a new bicycle for. Christmas . 
9. His old one was broken. 

( one substitutes for bicyc le ) 
The words occurring as nominal substitutes ^re: 6iie ^ ones ; same 

b) Verbal substitution 

e.g. 10. Robert got a new bicycle. 
11. I dldvtoo. 

(did substitutes for got a. new bicycle ) 
The verb do is the only verb identified by Halliday and H^an^that 
occurs as a verbal substitute. . 

c) Clausal substitution 

e.g. 12. Is Robert getting a new bicycle? ' ^ 

13. I hope so. y 

(so substitutes for Robert is getting a iipx^^ilcy^x^) 
Along with so, not can substitute for a clause. ( 
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Ellipsis 

Ellipsis is father like substitution and is in fact described by 
Halliday apd Hasan as "substitution by zero'*. As in substitution, 
three^ sub-categories are delineated depending on whether the ^presumed 
item functions as a noun, verb or clause. 

a) Nominal ellipsis 

e.^. 14. Mary had a chocolate milkshake. 
15.* Jean has a vanilla. 

* . .(a vanilla milkshake is presumed), 

b) Verbal ellipsis ^ ' ' 
e.g. 16. Did Mar^? do away? 

17. She may have. ^ , ^ r 

' (gQ^Q away is presumed) . 

c) Clausal ellipsis 

e.g. 18. , Mary was shaving a milkshake. 
19. Who^ was? 

(having a, milkshake is presumed) . ^ 

Conj unctiqn 

Conjunctions 'serve? to connect one part of a text with another and 
thus form cohesive ties. Halliday and Hasan (1976) describe such 
relations as Indirect in that conjunctions '"express certain meanings 
which presuppose the presence of other components in the discourse" 
(p. 226). They Identify four sub-categories. 

(a) Additive (and , also, nor, in addition , by the way etc.) 

(b) S^^dversative (but, ^et, though , however, on the other hand etc.) 

(c) Causal (so, hence , as a result , there foife etc.) 

These four sub-categories are illustrated in the following passa-ge 
(pp. 238-239). 

14. For the whole day he climbed up the steep mountain- 
side, almost without stopping. 

15. And in all this time he met no one. Additive 



16. Yet he was. hardly aware of being tired. Adversative 

17. So b^r night-time 'the^,valley was far 

below him. , ' Causal 

18. Then, as dusk fell, he sat down to 

Temporal 
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5. Lexical cohesion > ' 

Reference, substitution, ellipsis and conjunction are types of cohesion 
that are "realised" through the grammar. A fifth type of cohesion is ' 
realised through the vocabulary. Halliday and Hasan term this lexical 
cohesion. They identify two sub>-categories . \ 

a) Reiteration ' . 

i.e^. the repetition of the same item, or the use of a synonym (or 
near synonym), a superordinate, or a general noun. For example: 
19a I turned to the ascent of the peak. ' 

20. The ascent is perfectly easy. Repetition 

21. The climb is perfectly easy. ^ Synonym 

22. The task is perfectly easy. Superordinate 
^ 23. The thing is perfectly easy. General noun 

b) Collocation 

i.e. ' the association of items that regularly co-occur" (Halliday 
and Hasan, p. 285). For exa^mple, night may be associated with day, 
book with newspaper , private with sergeant, curtain with window. . .etc • 

Analysis of cohesive ties 

As part of- the workshop the cohesive ties in each of the three parallel 
texts wer^ identified. Just the overall results are presented here. Thus 
Table 1 presents, for each of the three texts, the total pattern of cohesive 
ftes, sub-divided into grammatical ties (reference, substitution and ellipsis) 
and lexical ties. Following Hasan (1980), conjunctjLons were omitted from ^ 
this part of the analysis. 

L 

There is no reason, of course, why the total number of ties, or for that 
matter the number of grammatical or lexical ties, should show any variation 
across texts sincfe these counts would be a function of text length. Accordingly, 
the individual counts were first expressed as percentages. Examination of 
Table 1 shoWs^ that when this was done the pattern in Passages B and C appear 
very similar with the breakdown of grammatical to lexical ties being approximately 
60:40 whereas in Pas'sage A it is nearer 70:30. In other words. Passage A is 
the one that is different though whether this is a significant difference or 
not is unclear. 



^ Another way of taking text length Into acoeunt Is to express number of 
ties as a function of number of words "or number rf)f sentences. When this was 
done,: as Is also seen In Table 1, Passages A and C seem remarkably similar 
-with Passage B being the odd one out. The metric, ties per word, Is a measure 
of the tie density.. Passage B on this metric has the greatest tie density. 
Is tie density related to reading difficulty? It could ^ejha^s he. argued 
that the greater number of ties or connections ^readers need to make, then 
the more complex the reading task and hence tie density Is a measure of reading 
difficulty. On this reasoning. Passage B would' be the most difficulty. of the 
threr^assages to read^.and the remaining two -passages would be\pproxlmately > 
equally difficult. On the other hand, perhaps It could be argued that the 
greater the rate of cohe^on the more tightly knit Is the text and maybe this 
Is really an Indicator of reading ease. On- this line of argument, Passage, - 
B would be the easiest to read. 

The measure ties per sentence has perhaps greater grounds for being 
related to reading difficulty for practically every readability formula Is 
based on the notion that sentence length Is a measure of syntactic complexity. 
The argument here would be that shorter sentences contain fewer words and 
thus WOUI4 be expected to have a lower rate of ties per sentence. Certainly, 
the rate of- ties per sentence for Passage B of 4.00 appears to be considerably 
lower than the rates for either pf the other two passages (7.00 and 7.75 
respectively) . This line of reasoning would suggest that Passage B Is the 
easiest to read with little dlfferehcg In reading dlfflcuJ^y between Passages 
A and C. • 



Table 1 about here 



^^Ll^ifi :±L <->t:iier aele cLed text yarlable& 



i'he next titage was Lo focub attenLion on inoLc Liaditionai 
la an efrurt to see whether analysis of these variables would ^,hed lurthet 
light on the tentative results of the cohesion^ ancilysla or on the gtuu 
subjective ratings. 
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a' count was^herefore made of the- number of words, long words (defined 
as containing more than 6ix letters), syllables, monosyllables and sentences 
sincj^ these are the basic measures included in most readability formulae 
(see Table 2). Formulae incorporating percentage of syllables, for example, 
and sentence length are those of Flesch and Fry. Possibly more useful 

1 

than either of these- is a new formula Lix^and a super-fast version Rlx, ^ 
(Anderson 1983). Li^c is defined simply as follows: 

. Lix = word length + sentence length 

where word length is percentage of long words 

and sentence length is average length of sentence in words 

Table 2 includes these two calculations for the three texts^ and summing 
gives Lix scores of 30'. 1, 20.5 and 3Cf.5 respectively for Passages A, B and 
C. Passage B on th^ Lix criterion is thus seen to be the easiest and Passages 
A and C >of the^ approximately equal difficulty. ' ^ 



Table 2 about here 



Before trying to |.nterpret theserLix scores, let us note one of the 
major advantages of Li;K over most other readability estimates. By examining 
the compoiients of-^lx,i the relative contributions of sentence and woifd 
factors to estimated reading difficulty are readily apparent. Thus for 
Passage A (and somewhat similarly for Passage C) the sentence factor (20.2) 
contributes approximately twice as much to reading difficulty as the word < 
factor (9.9). Furthermore, these factor weights are themselves readily 
interpretable. A sentence factor of 20.2, for instance,^ indicates an average 
sentence length of about 20 words; while a word factc^r of 9.9 indicates that 
about 10 per cetit of the words, that is an' average of one word in ten, are 
long words. Passage B, by contrast;, .has the word factor as the major contributor 
to reading difficulty (approximately 12/20 or 60 per cent) compared witb the 
contribution of the sentence factor (approximately 8/20 or 40 perjLzent). r 
Again, these factor weights are readily interpretab3^e . "~ 

* /' 

\ 

While Lix is tvery e^sy to calculate, its close relative Hlx Is even 
easier. The full directions and interpret^ appear on the next page. All 
that is required is to count the number of long words, divide by the numbeL 



of sentences and multiply by 100. The interpteter allows the user to enter 
the Rlx score in the Table and read of f the 'equivalent grade level . ^ Thus 
for Passage A: - 

, Rix « (10/5) X 100 ^ ; 

= 200 (equivalent to Grade 6) 

Passage B is estimated at Grade A level ^nd Passage C at Grade 6 level. 



Discussion 



1 • / 

Estimating Reading Difficulty with Rlx' on next 
full page following second sentence in above 
paragraph 
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In the analysis of afty particular text it must be largely left to the ^ 
user as ^to which analysis is the most appropriate for a particular purpose.' 
The following is a brief coramenti on edch form of analysis. 

With subjective ^ratings users are not limited tp making mechanistic ^ 
counts. Theirs can be a global Judgment based on accumulated experience 
of children and of texts, and taking into account a variety of factors ^ 
thought to contribute to^reading difficulty or ease. If a number of judges, 
is avail,able, then an average Judgment is clearly more reliable than any 
single Judgment. The difficulty with subjective ratings, of course, is that 
a group of judges is not always available; judges are not always conscious 
of which factors they tade into account; and ratings across J judges can 
sometimes vary quite widely. In the present exercise with the three parallel 
texts of Aesop's fable, while there was general consenaiis that Passage B was 
the easiest to read, there was le^s agreement about the relative reading 
difficulty of Passages A and C, though most were Incj^ined to rat^ Passage C 
as harder than Passage A.- ^ 

The cohesion analysis provides add4tl[^i5nal information about texts that 
may sometimes be useful in explaining unexpected sourcea of reading difficulty 



For many t<iac^ers the concipet of cohesion is relatively new. Intuitively, 
the notion of semantic links without which there would not be text but, rather 
a Jumbrfe of sentences, - is an attractive one. Much more work needs to bet dpne ^ 



iri this'area though an important start has been made (see, for example, the . 
-March 1983 issue of. the Australian Journal of Reading which had as its. special 
focus cohesion and the reading teacher). The analyses reported above, where 
percentage of grammatical ^nd Igxical ties^ and ties. per word and per sentenT:e 
were calculated must he treated wi^h caution. Insufficient is known about, 
the relationship of cohesion and comprehensibility and although certain trends 
emerged with the three parallel texts, these must be thought of only as vfery • 
tentative' hypotheses to be explored in the analysis of further texts. Hasan 
(in press) takes this kind of analysis much further in her examination of 
central -and relevant tokens* and the interaction of chains of cohesive ties. 

^.T^e analysis of^selected linguistic variables, aa described ahoy.e, is 
a well-trodden path and, although the particular instruments described here 
(Lix and Rix) are relatively new, tney are based on a research paradigm gQing * 
back more than 50 years. The measures are quite objective, by whdch is meant 
dif ferent^analysts would arrive at the same results, as indepd would a computer. 
However, just because Lix an^i Rix come up with quantifiable indices, there 
may be a danger of regarding these as more precise than they really aire. Such 
indices may be useful in providing estimates of difficulty but the user mtasjt , 
always remember the underlying assumptions on which such estimates are bashed, 
namely thatL>tfTTtence length and word length are themselves Substitutes Jor 
sentence complexity and word complexity, variables very difficult to measure 
more precisely and directly. ' 

Finally, all the analyses discussed above are indirc^ct measures for 
none takes the reader directly into account. It could be argued that such 
Indirect mieasures therefore lac}c validity. This is qu issue about which there 
will always be considerable controversy. The view taken here Is that, indirect 
measures thpugh they may be, they may nevertheless provide the teacher with 
useful insights in critically evaluating texts for use in the classroom. 
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" Passage A; The Dgg and His? Shadow ; ' ^ 

It happ'ened that^a dog had got a piece of meat and was farrylngv±t home In his 
- mouth to eat. Now on his way hom^ he had to cross* a pXank, lyings across a 
• running brooki As he crossed, he* looke"^ down and saw his own shadow reflected 
Tj^^^^^^?-^'^^^^ ^^^ ^^8 it jw^is anp ther _of ■ m(Mfc^ L 

made lip also. So he mad^ a snap at the shadow, but ' 

?8 ^ oiW®!^ hls mouth the piece of meat.f^ll' out , dropped Into th^ water, and. 
was neVer seen.mgre. . I h - 

' : "Passalge B: The- Dog and Hi^ Borte • r 

A, little dog hurried to the stream iW.th a large Juicy bone In his mouth. He 
A ^wanted to- eat the bone all by himself. So he r^n across a log that bridged 
the stream. Theji, In the water, he saw a picture of himself . But he thought 
-It was another dog.' "Ah, no|r I shall liave two nice bonetf to eat'^ thought the 
l:greedy rittle dog. He growled and snapped at the other bone./ .SPLASHl His 
bone^ fell' In the \yter. And so did hel Now* he had nothing to eat. 

a 

Greedy-gree^ makes a hungry puppy. 



Passage C: The Dog and the Shadow ' , ' ' " 

A dog, crossing a bridge oyer a stream with', a piece of flesh In his mouth saw 

» ■■ ' • , , « . , • ■ * • ■ 

his own shadow In the water, and took It for knother dog, with a piece of meat 

double his own In size. He therefore let go his Own, and fiercely attacked 

the other )iog, to get his larger piece from him. He thus lost, both - that 

which he grasped. at In the water, because It was a ^shadow; and his own 

because the stream swept It away. ^ r * 
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't:h6 cupbi^ard 



was 



bare. 



And 



so 



the poor dog 



the cupboard 
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got 




none 
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ties^ 



r- Reference 



— Substitution- 



— Ellips4.8 * ^ 



Personal 
demonstrative 
con&paratlve ^ 



X 



f 



nominal 

verbal 

clausal 



P additive (^and') 
- adversative ^(*but') 



— Conjunction - 



causal (*so') 
' temporal: :(^t^en' ) 



. .Lexical' 



P Repetition 
- .synondmy 



^ " supetordinate 



xrollocatlon 



« general nouns 



(after Ealliday and Ha^n 1976) 



EKLC 



14 



TABLE 1: CoheQive Ties 



Passage^ A 



Passage B 



Passage C 



1. 



No* of grammatical ties 
No» of lexical ties 
Total no. of ties 

% of grammatical ties 
*Z of lexical ties ^ 

Noi of words 
/ Np of sentehces , 
Ties per word. 
Ties per sentence 



r 



25 
10 
35 

71 
29 



dOl 
. ■ 5 

.35 
7.010 



<29 
19 

•48 

60 
40 

96 
12 
.5b 
4.00 



19 
12 
31 

61 

39 

84 
4 

.3.7 
7.75 



TABliE 2: Selected Linguistic VaJ|iables 



Passage A Passage B Passage C- 



No* of words 
No. of long words . 
No. of syllables 
No. of monosyllables 

No. of sentences 
Sent, length (words) 
% long words 



101 




96 V . 


84 


10 




12- 


8 


120 




,116 


103 


84 




79 . 


68 


5 




12 


4 


20.2 




8.0 


21.0 


9.9 




1^.5 


9.5 



ESTIMAf ING READING DIFFICULTY WITH RIX* 

Developed by Jonf than^ Aiaderson (Flinders University of Soutm Australia) 

'..'/■■ • *\ 

i)irections ' * * ' ■ 

!• Select a sample of sentences from the book(s) to be analysed. * It is ndt 

possible to be precise about the, number of samples since. this depends < in 

part on the size of the book(s) and in part on the coSlhlstency of 
writing*. As a guide, for short tex^, ten. 10-sentence samples .taken 
regularly through the book may be sufficient; Wiile for •longer -works, 
samples of at least twice this size will probably •bfe required. V^ry 
short texts may be analysed in their entirety. \' " '/ 

2. For each total sample (excluding headings, captions, etc. «) ' -J 



3. 



(a) Count the number of sentences 

A sentence is defined generally as *a sequence of Words termiliati^dt by 
a full-stop, question or exclamation ' mark,' colon- pr *semi-colpn. 
\ However, in direct speech, sequences like "Where? "V he asked l^nd 
"Gol" he ordered , count as single sentences^ Y ^ 

■ . ■ ^ "A - , " 

(b) Count the number of long words (i.e. words of 7 or more characters 

after excluding hypens, punctuation marks and brackets)\ _ 

A word is defined generally as a sequence of, c^iaracWrs bound 
white spaces. Thus numbers like 1,46X 4i^d 1^-2, hyph; 
sequences^ abbreviations (^.g. \ IRA, a.m.), d^es \ sue 
(1981-1982), and symbols like % codia^t^ as single words. 

Determine Rix by dividing the number of 
sentences and multiplying by 100. 



lo^ng words by the number 



Interpretation 




1. To find the equivalent grade level of difficulty for Rix, enter the Table 
below and locate t1ie corresponding grade. ::r> ; ^ 



Rix score 


Rix Krade level 


0-19 


1 


20-49 


2 


50-79 


3 


80^129 


4 . 


130-179 


5 


180-239 


■■: ' 6 . 


240-299 ■ 


. 7 - 


300-369 


.8 


370-449' 


.9 


450-529 


10 


. 530-619 


11 . 


620-719 


. , . ' 12 ' 


• 720+ 


College 
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